Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 113066 |
| Missing cells | 190746 |
| Missing cells (%) | 7.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 65.5 MiB |
| Average record size in memory | 607.7 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 9 |
| Boolean | 2 |
ListingKey has a high cardinality: 113066 distinct values | High cardinality |
ListingCreationDate has a high cardinality: 113064 distinct values | High cardinality |
ClosedDate has a high cardinality: 2802 distinct values | High cardinality |
LoanOriginalAmount is highly correlated with MonthlyLoanPayment | High correlation |
MonthlyLoanPayment is highly correlated with LoanOriginalAmount | High correlation |
BorrowerAPR is highly correlated with BorrowerRate | High correlation |
BorrowerRate is highly correlated with BorrowerAPR and 2 other fields | High correlation |
CreditScoreRangeLower is highly correlated with BorrowerRate and 1 other fields | High correlation |
CreditScoreRangeUpper is highly correlated with BorrowerRate and 1 other fields | High correlation |
LoanOriginalAmount is highly correlated with MonthlyLoanPayment | High correlation |
MonthlyLoanPayment is highly correlated with LoanOriginalAmount | High correlation |
BorrowerAPR is highly correlated with BorrowerRate | High correlation |
BorrowerRate is highly correlated with BorrowerAPR | High correlation |
CreditScoreRangeLower is highly correlated with CreditScoreRangeUpper | High correlation |
CreditScoreRangeUpper is highly correlated with CreditScoreRangeLower | High correlation |
IncomeVerifiable is highly correlated with DebtToIncomeRatio | High correlation |
DebtToIncomeRatio is highly correlated with IncomeVerifiable | High correlation |
LoanOriginalAmount is highly correlated with MonthlyLoanPayment | High correlation |
MonthlyLoanPayment is highly correlated with LoanOriginalAmount | High correlation |
BorrowerAPR is highly correlated with BorrowerRate | High correlation |
BorrowerRate is highly correlated with BorrowerAPR | High correlation |
CreditScoreRangeLower is highly correlated with CreditScoreRangeUpper | High correlation |
CreditScoreRangeUpper is highly correlated with CreditScoreRangeLower | High correlation |
LoanStatus is highly correlated with Term and 1 other fields | High correlation |
Term is highly correlated with LoanStatus | High correlation |
LoanOriginalAmount is highly correlated with MonthlyLoanPayment and 1 other fields | High correlation |
MonthlyLoanPayment is highly correlated with LoanOriginalAmount | High correlation |
BorrowerAPR is highly correlated with BorrowerRate and 2 other fields | High correlation |
BorrowerRate is highly correlated with BorrowerAPR and 2 other fields | High correlation |
CreditGrade is highly correlated with LoanOriginalAmount and 4 other fields | High correlation |
ProsperRating (Alpha) is highly correlated with BorrowerAPR and 1 other fields | High correlation |
CreditScoreRangeLower is highly correlated with CreditGrade and 1 other fields | High correlation |
CreditScoreRangeUpper is highly correlated with CreditGrade and 1 other fields | High correlation |
IncomeRange is highly correlated with EmploymentStatus | High correlation |
IncomeVerifiable is highly correlated with DebtToIncomeRatio and 1 other fields | High correlation |
DebtToIncomeRatio is highly correlated with IncomeVerifiable | High correlation |
EmploymentStatus is highly correlated with LoanStatus and 2 other fields | High correlation |
ClosedDate has 57990 (51.3%) missing values | Missing |
CreditGrade has 84113 (74.4%) missing values | Missing |
ProsperRating (Alpha) has 29084 (25.7%) missing values | Missing |
DebtToIncomeRatio has 8472 (7.5%) missing values | Missing |
EmploymentStatus has 2255 (2.0%) missing values | Missing |
EmploymentStatusDuration has 7625 (6.7%) missing values | Missing |
StatedMonthlyIncome is highly skewed (γ1 = 125.0987676) | Skewed |
df_index is uniformly distributed | Uniform |
ListingKey is uniformly distributed | Uniform |
ListingCreationDate is uniformly distributed | Uniform |
df_index has unique values | Unique |
ListingKey has unique values | Unique |
ListingCategory (numeric) has 16965 (15.0%) zeros | Zeros |
StatedMonthlyIncome has 1394 (1.2%) zeros | Zeros |
EmploymentStatusDuration has 1503 (1.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-08-25 16:15:53.172328 |
|---|---|
| Analysis finished | 2022-08-25 16:16:41.483769 |
| Duration | 48.31 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 113066 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56854.43196 |
| Minimum | 0 |
|---|---|
| Maximum | 113936 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5657.25 |
| Q1 | 28353.25 |
| median | 56781.5 |
| Q3 | 85339.75 |
| 95-th percentile | 108209.75 |
| Maximum | 113936 |
| Range | 113936 |
| Interquartile range (IQR) | 56986.5 |
Descriptive statistics
| Standard deviation | 32897.80264 |
|---|---|
| Coefficient of variation (CV) | 0.5786321576 |
| Kurtosis | -1.200343729 |
| Mean | 56854.43196 |
| Median Absolute Deviation (MAD) | 28493 |
| Skewness | 0.004341561274 |
| Sum | 6428303204 |
| Variance | 1082265418 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 75811 | 1 | < 0.1% |
| 75822 | 1 | < 0.1% |
| 75821 | 1 | < 0.1% |
| 75820 | 1 | < 0.1% |
| 75819 | 1 | < 0.1% |
| 75818 | 1 | < 0.1% |
| 75817 | 1 | < 0.1% |
| 75816 | 1 | < 0.1% |
| 75815 | 1 | < 0.1% |
| Other values (113056) | 113056 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 113936 | 1 | |
| 113935 | 1 | |
| 113934 | 1 | |
| 113933 | 1 | |
| 113932 | 1 | |
| 113931 | 1 | |
| 113930 | 1 | |
| 113929 | 1 | |
| 113928 | 1 | |
| 113927 | 1 |
| Distinct | 113066 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.6 MiB |
| 1021339766868145413AB3B | 1 |
|---|---|
| F0663582993853438C8A8E0 | 1 |
| 66983585151599608A5ABC6 | 1 |
| 66953459735530674736867 | 1 |
| 6693339801389188068FB4E | 1 |
| Other values (113061) |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 2600518 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 113066 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1021339766868145413AB3B |
|---|---|
| 2nd row | 10273602499503308B223C1 |
| 3rd row | 0EE9337825851032864889A |
| 4th row | 0EF5356002482715299901A |
| 5th row | 0F023589499656230C5E3E2 |
Common Values
| Value | Count | Frequency (%) |
| 1021339766868145413AB3B | 1 | < 0.1% |
| F0663582993853438C8A8E0 | 1 | < 0.1% |
| 66983585151599608A5ABC6 | 1 | < 0.1% |
| 66953459735530674736867 | 1 | < 0.1% |
| 6693339801389188068FB4E | 1 | < 0.1% |
| 6AB735643902836208D76F3 | 1 | < 0.1% |
| 6AAB34147517188050BD961 | 1 | < 0.1% |
| 6AA0359887299121671E424 | 1 | < 0.1% |
| F7E3359743115455359D06A | 1 | < 0.1% |
| F7E13468604552859BD924B | 1 | < 0.1% |
| Other values (113056) | 113056 |
Length
| Value | Count | Frequency (%) |
| 1021339766868145413ab3b | 1 | < 0.1% |
| 0ffc35866018516621b0d3f | 1 | < 0.1% |
| 0f1035772717087366f9ea7 | 1 | < 0.1% |
| 0f043596202561788ea13d5 | 1 | < 0.1% |
| 0f123545674891886d9f106 | 1 | < 0.1% |
| 0f1734025150298088a5f2b | 1 | < 0.1% |
| 0f1a3597143888805163ef7 | 1 | < 0.1% |
| 0f1c3583260311305d68f87 | 1 | < 0.1% |
| 0fbc3556025226720be6dd4 | 1 | < 0.1% |
| 0f353575943675863d1afc0 | 1 | < 0.1% |
| Other values (113056) | 113056 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 318863 | |
| 5 | 259829 | |
| 4 | 212165 | |
| 9 | 206809 | |
| 6 | 201381 | |
| 8 | 200419 | |
| 0 | 198411 | |
| 7 | 195121 | |
| 2 | 192853 | |
| 1 | 191444 | |
| Other values (6) | 423223 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2177295 | |
| Uppercase Letter | 423223 | 16.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 318863 | |
| 5 | 259829 | |
| 4 | 212165 | |
| 9 | 206809 | |
| 6 | 201381 | |
| 8 | 200419 | |
| 0 | 198411 | |
| 7 | 195121 | |
| 2 | 192853 | |
| 1 | 191444 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 70894 | |
| C | 70794 | |
| A | 70479 | |
| F | 70423 | |
| E | 70415 | |
| B | 70218 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2177295 | |
| Latin | 423223 | 16.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 318863 | |
| 5 | 259829 | |
| 4 | 212165 | |
| 9 | 206809 | |
| 6 | 201381 | |
| 8 | 200419 | |
| 0 | 198411 | |
| 7 | 195121 | |
| 2 | 192853 | |
| 1 | 191444 |
Latin
| Value | Count | Frequency (%) |
| D | 70894 | |
| C | 70794 | |
| A | 70479 | |
| F | 70423 | |
| E | 70415 | |
| B | 70218 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2600518 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 318863 | |
| 5 | 259829 | |
| 4 | 212165 | |
| 9 | 206809 | |
| 6 | 201381 | |
| 8 | 200419 | |
| 0 | 198411 | |
| 7 | 195121 | |
| 2 | 192853 | |
| 1 | 191444 | |
| Other values (6) | 423223 |
| Distinct | 113064 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.3 MiB |
| 2013-06-03 17:27:50.540000000 | 2 |
|---|---|
| 2012-10-20 12:21:46.333000000 | 2 |
| 2007-08-26 19:09:29.263000000 | 1 |
| 2012-01-08 05:15:06.027000000 | 1 |
| 2014-01-14 08:01:37.673000000 | 1 |
| Other values (113059) |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 28.96621442 |
| Min length | 19 |
Characters and Unicode
| Total characters | 3275094 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 113062 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | 2007-08-26 19:09:29.263000000 |
|---|---|
| 2nd row | 2014-02-27 08:28:07.900000000 |
| 3rd row | 2007-01-05 15:00:47.090000000 |
| 4th row | 2012-10-22 11:02:35.010000000 |
| 5th row | 2013-09-14 18:38:39.097000000 |
Common Values
| Value | Count | Frequency (%) |
| 2013-06-03 17:27:50.540000000 | 2 | < 0.1% |
| 2012-10-20 12:21:46.333000000 | 2 | < 0.1% |
| 2007-08-26 19:09:29.263000000 | 1 | < 0.1% |
| 2012-01-08 05:15:06.027000000 | 1 | < 0.1% |
| 2014-01-14 08:01:37.673000000 | 1 | < 0.1% |
| 2011-11-23 11:47:38.790000000 | 1 | < 0.1% |
| 2013-07-26 15:38:41.637000000 | 1 | < 0.1% |
| 2009-08-04 11:51:49.630000000 | 1 | < 0.1% |
| 2007-08-22 15:47:58.417000000 | 1 | < 0.1% |
| 2012-11-19 06:45:40.867000000 | 1 | < 0.1% |
| Other values (113054) | 113054 |
Length
| Value | Count | Frequency (%) |
| 2013-11-04 | 295 | 0.1% |
| 2013-12-03 | 271 | 0.1% |
| 2014-01-08 | 268 | 0.1% |
| 2013-12-02 | 265 | 0.1% |
| 2013-12-05 | 249 | 0.1% |
| 2013-09-16 | 244 | 0.1% |
| 2013-09-17 | 242 | 0.1% |
| 2013-12-04 | 242 | 0.1% |
| 2014-01-13 | 240 | 0.1% |
| 2014-01-15 | 239 | 0.1% |
| Other values (115411) | 223577 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1132112 | |
| 1 | 357676 | 10.9% |
| 2 | 303882 | 9.3% |
| - | 226132 | 6.9% |
| : | 226132 | 6.9% |
| 3 | 189101 | 5.8% |
| 7 | 126992 | 3.9% |
| 4 | 121271 | 3.7% |
| 113066 | 3.5% | |
| . | 112684 | 3.4% |
| Other values (4) | 366046 | 11.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2597080 | |
| Other Punctuation | 338816 | 10.3% |
| Dash Punctuation | 226132 | 6.9% |
| Space Separator | 113066 | 3.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1132112 | |
| 1 | 357676 | 13.8% |
| 2 | 303882 | 11.7% |
| 3 | 189101 | 7.3% |
| 7 | 126992 | 4.9% |
| 4 | 121271 | 4.7% |
| 5 | 112364 | 4.3% |
| 8 | 89284 | 3.4% |
| 6 | 83560 | 3.2% |
| 9 | 80838 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 226132 | |
| . | 112684 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 226132 |
Space Separator
| Value | Count | Frequency (%) |
| 113066 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3275094 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1132112 | |
| 1 | 357676 | 10.9% |
| 2 | 303882 | 9.3% |
| - | 226132 | 6.9% |
| : | 226132 | 6.9% |
| 3 | 189101 | 5.8% |
| 7 | 126992 | 3.9% |
| 4 | 121271 | 3.7% |
| 113066 | 3.5% | |
| . | 112684 | 3.4% |
| Other values (4) | 366046 | 11.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3275094 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1132112 | |
| 1 | 357676 | 10.9% |
| 2 | 303882 | 9.3% |
| - | 226132 | 6.9% |
| : | 226132 | 6.9% |
| 3 | 189101 | 5.8% |
| 7 | 126992 | 3.9% |
| 4 | 121271 | 3.7% |
| 113066 | 3.5% | |
| . | 112684 | 3.4% |
| Other values (4) | 366046 | 11.2% |
| Distinct | 2802 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 57990 |
| Missing (%) | 51.3% |
| Memory size | 5.8 MiB |
| 2014-03-04 00:00:00 | 105 |
|---|---|
| 2014-02-19 00:00:00 | 100 |
| 2014-02-11 00:00:00 | 92 |
| 2012-10-30 00:00:00 | 81 |
| 2013-02-26 00:00:00 | 78 |
| Other values (2797) |
Length
| Max length | 29 |
|---|---|
| Median length | 19 |
| Mean length | 19.00363135 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1046644 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 110 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2009-08-14 00:00:00 |
|---|---|
| 2nd row | 2009-12-17 00:00:00 |
| 3rd row | 2008-01-07 00:00:00 |
| 4th row | 2012-12-19 00:00:00 |
| 5th row | 2008-05-22 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2014-03-04 00:00:00 | 105 | 0.1% |
| 2014-02-19 00:00:00 | 100 | 0.1% |
| 2014-02-11 00:00:00 | 92 | 0.1% |
| 2012-10-30 00:00:00 | 81 | 0.1% |
| 2013-02-26 00:00:00 | 78 | 0.1% |
| 2014-01-30 00:00:00 | 76 | 0.1% |
| 2014-01-14 00:00:00 | 75 | 0.1% |
| 2014-02-18 00:00:00 | 72 | 0.1% |
| 2014-02-24 00:00:00 | 72 | 0.1% |
| 2014-02-04 00:00:00 | 71 | 0.1% |
| Other values (2792) | 54254 | |
| (Missing) | 57990 |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 55056 | |
| 2014-03-04 | 105 | 0.1% |
| 2014-02-19 | 100 | 0.1% |
| 2014-02-11 | 92 | 0.1% |
| 2012-10-30 | 81 | 0.1% |
| 2013-02-26 | 78 | 0.1% |
| 2014-01-30 | 76 | 0.1% |
| 2014-01-14 | 75 | 0.1% |
| 2014-02-18 | 72 | 0.1% |
| 2014-02-24 | 72 | 0.1% |
| Other values (2793) | 54345 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 478194 | |
| - | 110152 | 10.5% |
| : | 110152 | 10.5% |
| 2 | 96515 | 9.2% |
| 1 | 92206 | 8.8% |
| 55076 | 5.3% | |
| 3 | 25479 | 2.4% |
| 9 | 18106 | 1.7% |
| 8 | 15827 | 1.5% |
| 7 | 13251 | 1.3% |
| Other values (4) | 31686 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 771244 | |
| Other Punctuation | 110172 | 10.5% |
| Dash Punctuation | 110152 | 10.5% |
| Space Separator | 55076 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 478194 | |
| 2 | 96515 | 12.5% |
| 1 | 92206 | 12.0% |
| 3 | 25479 | 3.3% |
| 9 | 18106 | 2.3% |
| 8 | 15827 | 2.1% |
| 7 | 13251 | 1.7% |
| 4 | 12329 | 1.6% |
| 6 | 9933 | 1.3% |
| 5 | 9404 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 110152 | |
| . | 20 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110152 |
Space Separator
| Value | Count | Frequency (%) |
| 55076 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1046644 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 478194 | |
| - | 110152 | 10.5% |
| : | 110152 | 10.5% |
| 2 | 96515 | 9.2% |
| 1 | 92206 | 8.8% |
| 55076 | 5.3% | |
| 3 | 25479 | 2.4% |
| 9 | 18106 | 1.7% |
| 8 | 15827 | 1.5% |
| 7 | 13251 | 1.3% |
| Other values (4) | 31686 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1046644 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 478194 | |
| - | 110152 | 10.5% |
| : | 110152 | 10.5% |
| 2 | 96515 | 9.2% |
| 1 | 92206 | 8.8% |
| 55076 | 5.3% | |
| 3 | 25479 | 2.4% |
| 9 | 18106 | 1.7% |
| 8 | 15827 | 1.5% |
| 7 | 13251 | 1.3% |
| Other values (4) | 31686 | 3.0% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.0 MiB |
| Current | |
|---|---|
| Completed | |
| Chargedoff | |
| Defaulted | 5018 |
| Past Due (1-15 days) | 800 |
| Other values (7) | 1465 |
Length
| Max length | 22 |
|---|---|
| Median length | 21 |
| Mean length | 8.357393027 |
| Min length | 7 |
Characters and Unicode
| Total characters | 944937 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Completed |
|---|---|
| 2nd row | Current |
| 3rd row | Completed |
| 4th row | Current |
| 5th row | Current |
Common Values
| Value | Count | Frequency (%) |
| Current | 55730 | |
| Completed | 38061 | |
| Chargedoff | 11992 | 10.6% |
| Defaulted | 5018 | 4.4% |
| Past Due (1-15 days) | 800 | 0.7% |
| Past Due (31-60 days) | 361 | 0.3% |
| Past Due (61-90 days) | 311 | 0.3% |
| Past Due (91-120 days) | 304 | 0.3% |
| Past Due (16-30 days) | 265 | 0.2% |
| FinalPaymentInProgress | 203 | 0.2% |
| Other values (2) | 21 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| current | 55730 | |
| completed | 38061 | |
| chargedoff | 11992 | 10.1% |
| defaulted | 5018 | 4.2% |
| past | 2057 | 1.7% |
| due | 2057 | 1.7% |
| days | 2057 | 1.7% |
| 1-15 | 800 | 0.7% |
| 31-60 | 361 | 0.3% |
| 61-90 | 311 | 0.3% |
| Other values (5) | 793 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 156353 | |
| r | 123858 | |
| C | 105788 | |
| t | 101069 | |
| u | 62805 | |
| d | 57133 | 6.0% |
| n | 56344 | 6.0% |
| o | 50256 | 5.3% |
| l | 43292 | 4.6% |
| m | 38264 | 4.0% |
| Other values (25) | 149775 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 809147 | |
| Uppercase Letter | 115732 | 12.2% |
| Decimal Number | 7716 | 0.8% |
| Space Separator | 6171 | 0.7% |
| Open Punctuation | 2057 | 0.2% |
| Close Punctuation | 2057 | 0.2% |
| Dash Punctuation | 2041 | 0.2% |
| Math Symbol | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 156353 | |
| r | 123858 | |
| t | 101069 | |
| u | 62805 | |
| d | 57133 | 7.1% |
| n | 56344 | 7.0% |
| o | 50256 | 6.2% |
| l | 43292 | 5.4% |
| m | 38264 | 4.7% |
| p | 38061 | 4.7% |
| Other values (8) | 81712 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3161 | |
| 0 | 1257 | 16.3% |
| 6 | 937 | 12.1% |
| 5 | 800 | 10.4% |
| 3 | 626 | 8.1% |
| 9 | 615 | 8.0% |
| 2 | 320 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 105788 | |
| D | 7075 | 6.1% |
| P | 2463 | 2.1% |
| F | 203 | 0.2% |
| I | 203 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 6171 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2057 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2057 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2041 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 924879 | |
| Common | 20058 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 156353 | |
| r | 123858 | |
| C | 105788 | |
| t | 101069 | |
| u | 62805 | |
| d | 57133 | 6.2% |
| n | 56344 | 6.1% |
| o | 50256 | 5.4% |
| l | 43292 | 4.7% |
| m | 38264 | 4.1% |
| Other values (13) | 129717 |
Common
| Value | Count | Frequency (%) |
| 6171 | ||
| 1 | 3161 | |
| ( | 2057 | 10.3% |
| ) | 2057 | 10.3% |
| - | 2041 | 10.2% |
| 0 | 1257 | 6.3% |
| 6 | 937 | 4.7% |
| 5 | 800 | 4.0% |
| 3 | 626 | 3.1% |
| 9 | 615 | 3.1% |
| Other values (2) | 336 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 944937 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 156353 | |
| r | 123858 | |
| C | 105788 | |
| t | 101069 | |
| u | 62805 | |
| d | 57133 | 6.0% |
| n | 56344 | 6.0% |
| o | 50256 | 5.3% |
| l | 43292 | 4.6% |
| m | 38264 | 4.0% |
| Other values (25) | 149775 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| 36 | |
|---|---|
| 60 | |
| 12 | 1614 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 226132 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 |
|---|---|
| 2nd row | 36 |
| 3rd row | 36 |
| 4th row | 36 |
| 5th row | 36 |
Common Values
| Value | Count | Frequency (%) |
| 36 | 87224 | |
| 60 | 24228 | 21.4% |
| 12 | 1614 | 1.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 36 | 87224 | |
| 60 | 24228 | 21.4% |
| 12 | 1614 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 111452 | |
| 3 | 87224 | |
| 0 | 24228 | 10.7% |
| 1 | 1614 | 0.7% |
| 2 | 1614 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 226132 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 111452 | |
| 3 | 87224 | |
| 0 | 24228 | 10.7% |
| 1 | 1614 | 0.7% |
| 2 | 1614 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 226132 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 111452 | |
| 3 | 87224 | |
| 0 | 24228 | 10.7% |
| 1 | 1614 | 0.7% |
| 2 | 1614 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 226132 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 111452 | |
| 3 | 87224 | |
| 0 | 24228 | 10.7% |
| 1 | 1614 | 0.7% |
| 2 | 1614 | 0.7% |
LoanOriginalAmount
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 2468 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8314.762307 |
| Minimum | 1000 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1500 |
| Q1 | 4000 |
| median | 6300 |
| Q3 | 12000 |
| 95-th percentile | 20000 |
| Maximum | 35000 |
| Range | 34000 |
| Interquartile range (IQR) | 8000 |
Descriptive statistics
| Standard deviation | 6237.007841 |
|---|---|
| Coefficient of variation (CV) | 0.7501125842 |
| Kurtosis | 1.331303374 |
| Mean | 8314.762307 |
| Median Absolute Deviation (MAD) | 3700 |
| Skewness | 1.224284612 |
| Sum | 940116915 |
| Variance | 38900266.81 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4000 | 14207 | 12.6% |
| 15000 | 12232 | 10.8% |
| 10000 | 10956 | 9.7% |
| 5000 | 6953 | 6.1% |
| 2000 | 6042 | 5.3% |
| 3000 | 5728 | 5.1% |
| 25000 | 3588 | 3.2% |
| 20000 | 3234 | 2.9% |
| 1000 | 3206 | 2.8% |
| 2500 | 2990 | 2.6% |
| Other values (2458) | 43930 |
| Value | Count | Frequency (%) |
| 1000 | 3206 | |
| 1001 | 8 | < 0.1% |
| 1005 | 2 | < 0.1% |
| 1010 | 1 | < 0.1% |
| 1025 | 33 | < 0.1% |
| 1030 | 6 | < 0.1% |
| 1031 | 2 | < 0.1% |
| 1032 | 1 | < 0.1% |
| 1035 | 1 | < 0.1% |
| 1036 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 418 | |
| 34999 | 3 | < 0.1% |
| 34700 | 1 | < 0.1% |
| 34679 | 1 | < 0.1% |
| 34000 | 5 | < 0.1% |
| 33750 | 2 | < 0.1% |
| 33710 | 1 | < 0.1% |
| 33500 | 2 | < 0.1% |
| 33411 | 1 | < 0.1% |
| 33000 | 5 | < 0.1% |
MonthlyLoanPayment
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 23567 |
|---|---|
| Distinct (%) | 20.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 271.9327422 |
| Minimum | 0 |
|---|---|
| Maximum | 2251.51 |
| Zeros | 935 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 52.55 |
| Q1 | 130.95 |
| median | 217.37 |
| Q3 | 370.57 |
| 95-th percentile | 633.655 |
| Maximum | 2251.51 |
| Range | 2251.51 |
| Interquartile range (IQR) | 239.62 |
Descriptive statistics
| Standard deviation | 192.5499791 |
|---|---|
| Coefficient of variation (CV) | 0.7080794225 |
| Kurtosis | 3.154029249 |
| Mean | 271.9327422 |
| Median Absolute Deviation (MAD) | 109.29 |
| Skewness | 1.41627881 |
| Sum | 30746347.43 |
| Variance | 37075.49443 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 173.71 | 2423 | 2.1% |
| 0 | 935 | 0.8% |
| 172.76 | 530 | 0.5% |
| 86.85 | 472 | 0.4% |
| 174.2 | 460 | 0.4% |
| 130.28 | 370 | 0.3% |
| 163.28 | 285 | 0.3% |
| 326.62 | 280 | 0.2% |
| 136.98 | 277 | 0.2% |
| 165.15 | 271 | 0.2% |
| Other values (23557) | 106763 |
| Value | Count | Frequency (%) |
| 0 | 935 | |
| 0.15 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.23 | 1 | < 0.1% |
| 0.24 | 1 | < 0.1% |
| 0.29 | 1 | < 0.1% |
| 0.44 | 1 | < 0.1% |
| 0.53 | 1 | < 0.1% |
| 0.58 | 1 | < 0.1% |
| 0.92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2251.51 | 1 | |
| 2218.53 | 1 | |
| 2179.22 | 1 | |
| 2163.63 | 1 | |
| 2153.38 | 1 | |
| 2147.64 | 1 | |
| 2134.06 | 1 | |
| 2111.78 | 1 | |
| 1808.84 | 1 | |
| 1781.28 | 1 |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.776838307 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 16965 |
| Zeros (%) | 15.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 13 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.998187782 |
|---|---|
| Coefficient of variation (CV) | 1.439834567 |
| Kurtosis | 5.823824174 |
| Mean | 2.776838307 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.443517343 |
| Sum | 313966 |
| Variance | 15.98550554 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 57624 | |
| 0 | 16965 | 15.0% |
| 7 | 10448 | 9.2% |
| 2 | 7388 | 6.5% |
| 3 | 7157 | 6.3% |
| 6 | 2568 | 2.3% |
| 4 | 2395 | 2.1% |
| 13 | 1987 | 1.8% |
| 15 | 1507 | 1.3% |
| 18 | 882 | 0.8% |
| Other values (11) | 4145 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 16965 | 15.0% |
| 1 | 57624 | |
| 2 | 7388 | 6.5% |
| 3 | 7157 | 6.3% |
| 4 | 2395 | 2.1% |
| 5 | 756 | 0.7% |
| 6 | 2568 | 2.3% |
| 7 | 10448 | 9.2% |
| 8 | 196 | 0.2% |
| 9 | 85 | 0.1% |
| Value | Count | Frequency (%) |
| 20 | 762 | 0.7% |
| 19 | 764 | 0.7% |
| 18 | 882 | |
| 17 | 52 | < 0.1% |
| 16 | 304 | 0.3% |
| 15 | 1507 | |
| 14 | 863 | |
| 13 | 1987 | |
| 12 | 58 | 0.1% |
| 11 | 214 | 0.2% |
| Distinct | 6677 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 25 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2189803536 |
| Minimum | 0.00653 |
|---|---|
| Maximum | 0.51229 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0.00653 |
|---|---|
| 5-th percentile | 0.09434 |
| Q1 | 0.15629 |
| median | 0.20984 |
| Q3 | 0.28386 |
| 95-th percentile | 0.35797 |
| Maximum | 0.51229 |
| Range | 0.50576 |
| Interquartile range (IQR) | 0.12757 |
Descriptive statistics
| Standard deviation | 0.08048277631 |
|---|---|
| Coefficient of variation (CV) | 0.3675342331 |
| Kurtosis | -0.8839274884 |
| Mean | 0.2189803536 |
| Median Absolute Deviation (MAD) | 0.06233 |
| Skewness | 0.2209129987 |
| Sum | 24753.75815 |
| Variance | 0.006477477283 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.35797 | 3672 | 3.2% |
| 0.35643 | 1644 | 1.5% |
| 0.37453 | 1260 | 1.1% |
| 0.30532 | 902 | 0.8% |
| 0.2951 | 747 | 0.7% |
| 0.35356 | 715 | 0.6% |
| 0.29776 | 707 | 0.6% |
| 0.15833 | 642 | 0.6% |
| 0.24246 | 605 | 0.5% |
| 0.24758 | 601 | 0.5% |
| Other values (6667) | 101546 |
| Value | Count | Frequency (%) |
| 0.00653 | 2 | |
| 0.00864 | 1 | < 0.1% |
| 0.01315 | 2 | |
| 0.01325 | 1 | < 0.1% |
| 0.01548 | 1 | < 0.1% |
| 0.01647 | 1 | < 0.1% |
| 0.0165 | 2 | |
| 0.01657 | 3 | |
| 0.01823 | 1 | < 0.1% |
| 0.01875 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.51229 | 1 | < 0.1% |
| 0.50633 | 1 | < 0.1% |
| 0.48873 | 1 | < 0.1% |
| 0.46201 | 1 | < 0.1% |
| 0.45857 | 2 | < 0.1% |
| 0.42395 | 1 | < 0.1% |
| 0.41355 | 55 | |
| 0.40831 | 2 | < 0.1% |
| 0.40745 | 4 | < 0.1% |
| 0.40679 | 11 | < 0.1% |
| Distinct | 2294 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1929457428 |
| Minimum | 0 |
|---|---|
| Maximum | 0.4975 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.082 |
| Q1 | 0.134 |
| median | 0.184 |
| Q3 | 0.2506 |
| 95-th percentile | 0.3177 |
| Maximum | 0.4975 |
| Range | 0.4975 |
| Interquartile range (IQR) | 0.1166 |
Descriptive statistics
| Standard deviation | 0.07491660314 |
|---|---|
| Coefficient of variation (CV) | 0.3882780831 |
| Kurtosis | -0.9115075572 |
| Mean | 0.1929457428 |
| Median Absolute Deviation (MAD) | 0.0579 |
| Skewness | 0.2723358036 |
| Sum | 21815.60335 |
| Variance | 0.005612497425 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3177 | 3672 | 3.2% |
| 0.35 | 1905 | 1.7% |
| 0.3199 | 1651 | 1.5% |
| 0.29 | 1508 | 1.3% |
| 0.2699 | 1314 | 1.2% |
| 0.15 | 1174 | 1.0% |
| 0.14 | 1022 | 0.9% |
| 0.1099 | 928 | 0.8% |
| 0.2 | 907 | 0.8% |
| 0.18 | 791 | 0.7% |
| Other values (2284) | 98194 |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 0.0001 | 1 | < 0.1% |
| 0.0005 | 1 | < 0.1% |
| 0.0021 | 1 | < 0.1% |
| 0.005 | 1 | < 0.1% |
| 0.0099 | 1 | < 0.1% |
| 0.01 | 11 | |
| 0.0115 | 1 | < 0.1% |
| 0.015 | 1 | < 0.1% |
| 0.0295 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.4975 | 2 | < 0.1% |
| 0.48 | 1 | < 0.1% |
| 0.45 | 3 | < 0.1% |
| 0.4 | 2 | < 0.1% |
| 0.375 | 1 | < 0.1% |
| 0.36 | 17 | < 0.1% |
| 0.3575 | 21 | < 0.1% |
| 0.357 | 2 | < 0.1% |
| 0.353 | 1 | < 0.1% |
| 0.35 | 1905 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 84113 |
| Missing (%) | 74.4% |
| Memory size | 4.2 MiB |
| C | |
|---|---|
| D | |
| B | |
| AA | |
| HR | |
| Other values (3) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.247228267 |
| Min length | 1 |
Characters and Unicode
| Total characters | 36111 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | HR |
| 3rd row | C |
| 4th row | AA |
| 5th row | D |
Common Values
| Value | Count | Frequency (%) |
| C | 5649 | 5.0% |
| D | 5153 | 4.6% |
| B | 4389 | 3.9% |
| AA | 3509 | 3.1% |
| HR | 3508 | 3.1% |
| A | 3315 | 2.9% |
| E | 3289 | 2.9% |
| NC | 141 | 0.1% |
| (Missing) | 84113 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| c | 5649 | |
| d | 5153 | |
| b | 4389 | |
| aa | 3509 | |
| hr | 3508 | |
| a | 3315 | |
| e | 3289 | |
| nc | 141 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10333 | |
| C | 5790 | |
| D | 5153 | |
| B | 4389 | |
| H | 3508 | 9.7% |
| R | 3508 | 9.7% |
| E | 3289 | 9.1% |
| N | 141 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 36111 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10333 | |
| C | 5790 | |
| D | 5153 | |
| B | 4389 | |
| H | 3508 | 9.7% |
| R | 3508 | 9.7% |
| E | 3289 | 9.1% |
| N | 141 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36111 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10333 | |
| C | 5790 | |
| D | 5153 | |
| B | 4389 | |
| H | 3508 | 9.7% |
| R | 3508 | 9.7% |
| E | 3289 | 9.1% |
| N | 141 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36111 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10333 | |
| C | 5790 | |
| D | 5153 | |
| B | 4389 | |
| H | 3508 | 9.7% |
| R | 3508 | 9.7% |
| E | 3289 | 9.1% |
| N | 141 | 0.4% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 29084 |
| Missing (%) | 25.7% |
| Memory size | 5.5 MiB |
| C | |
|---|---|
| B | |
| A | |
| D | |
| E | |
| Other values (2) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.145769332 |
| Min length | 1 |
Characters and Unicode
| Total characters | 96224 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | D |
| 4th row | B |
| 5th row | E |
Common Values
| Value | Count | Frequency (%) |
| C | 18096 | |
| B | 15368 | |
| A | 14390 | |
| D | 14170 | |
| E | 9716 | 8.6% |
| HR | 6917 | 6.1% |
| AA | 5325 | 4.7% |
| (Missing) | 29084 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| c | 18096 | |
| b | 15368 | |
| a | 14390 | |
| d | 14170 | |
| e | 9716 | |
| hr | 6917 | 8.2% |
| aa | 5325 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 25040 | |
| C | 18096 | |
| B | 15368 | |
| D | 14170 | |
| E | 9716 | 10.1% |
| H | 6917 | 7.2% |
| R | 6917 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 96224 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 25040 | |
| C | 18096 | |
| B | 15368 | |
| D | 14170 | |
| E | 9716 | 10.1% |
| H | 6917 | 7.2% |
| R | 6917 | 7.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 96224 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 25040 | |
| C | 18096 | |
| B | 15368 | |
| D | 14170 | |
| E | 9716 | 10.1% |
| H | 6917 | 7.2% |
| R | 6917 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96224 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 25040 | |
| C | 18096 | |
| B | 15368 | |
| D | 14170 | |
| E | 9716 | 10.1% |
| H | 6917 | 7.2% |
| R | 6917 | 7.2% |
CreditScoreRangeLower
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 591 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 685.5249611 |
| Minimum | 0 |
|---|---|
| Maximum | 880 |
| Zeros | 133 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 560 |
| Q1 | 660 |
| median | 680 |
| Q3 | 720 |
| 95-th percentile | 780 |
| Maximum | 880 |
| Range | 880 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 66.63589474 |
|---|---|
| Coefficient of variation (CV) | 0.09720418441 |
| Kurtosis | 13.22534937 |
| Mean | 685.5249611 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | -1.587483879 |
| Sum | 77104420 |
| Variance | 4440.342467 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 680 | 16315 | |
| 660 | 16177 | |
| 700 | 15315 | |
| 720 | 12797 | |
| 640 | 12099 | |
| 740 | 9211 | |
| 760 | 6566 | |
| 780 | 4607 | 4.1% |
| 620 | 4172 | 3.7% |
| 600 | 3601 | 3.2% |
| Other values (16) | 11615 |
| Value | Count | Frequency (%) |
| 0 | 133 | 0.1% |
| 360 | 1 | < 0.1% |
| 420 | 5 | < 0.1% |
| 440 | 36 | < 0.1% |
| 460 | 141 | 0.1% |
| 480 | 346 | 0.3% |
| 500 | 554 | 0.5% |
| 520 | 1593 | |
| 540 | 1474 | |
| 560 | 1357 |
| Value | Count | Frequency (%) |
| 880 | 27 | < 0.1% |
| 860 | 212 | 0.2% |
| 840 | 567 | 0.5% |
| 820 | 1408 | 1.2% |
| 800 | 2636 | 2.3% |
| 780 | 4607 | 4.1% |
| 760 | 6566 | |
| 740 | 9211 | |
| 720 | 12797 | |
| 700 | 15315 |
CreditScoreRangeUpper
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 591 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 704.5249611 |
| Minimum | 19 |
|---|---|
| Maximum | 899 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 579 |
| Q1 | 679 |
| median | 699 |
| Q3 | 739 |
| 95-th percentile | 799 |
| Maximum | 899 |
| Range | 880 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 66.63589474 |
|---|---|
| Coefficient of variation (CV) | 0.0945827308 |
| Kurtosis | 13.22534937 |
| Mean | 704.5249611 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | -1.587483879 |
| Sum | 79241445 |
| Variance | 4440.342467 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 699 | 16315 | |
| 679 | 16177 | |
| 719 | 15315 | |
| 739 | 12797 | |
| 659 | 12099 | |
| 759 | 9211 | |
| 779 | 6566 | |
| 799 | 4607 | 4.1% |
| 639 | 4172 | 3.7% |
| 619 | 3601 | 3.2% |
| Other values (16) | 11615 |
| Value | Count | Frequency (%) |
| 19 | 133 | 0.1% |
| 379 | 1 | < 0.1% |
| 439 | 5 | < 0.1% |
| 459 | 36 | < 0.1% |
| 479 | 141 | 0.1% |
| 499 | 346 | 0.3% |
| 519 | 554 | 0.5% |
| 539 | 1593 | |
| 559 | 1474 | |
| 579 | 1357 |
| Value | Count | Frequency (%) |
| 899 | 27 | < 0.1% |
| 879 | 212 | 0.2% |
| 859 | 567 | 0.5% |
| 839 | 1408 | 1.2% |
| 819 | 2636 | 2.3% |
| 799 | 4607 | 4.1% |
| 779 | 6566 | |
| 759 | 9211 | |
| 739 | 12797 | |
| 719 | 15315 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
| $25,000-49,999 | |
|---|---|
| $50,000-74,999 | |
| $100,000+ | |
| $75,000-99,999 | |
| Not displayed | |
| Other values (3) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 12.77107176 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1443974 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | $25,000-49,999 |
|---|---|
| 2nd row | $50,000-74,999 |
| 3rd row | Not displayed |
| 4th row | $25,000-49,999 |
| 5th row | $100,000+ |
Common Values
| Value | Count | Frequency (%) |
| $25,000-49,999 | 31940 | |
| $50,000-74,999 | 30749 | |
| $100,000+ | 17188 | |
| $75,000-99,999 | 16780 | |
| Not displayed | 7741 | 6.8% |
| $1-24,999 | 7241 | 6.4% |
| Not employed | 806 | 0.7% |
| $0 | 621 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 25,000-49,999 | 31940 | |
| 50,000-74,999 | 30749 | |
| 100,000 | 17188 | |
| 75,000-99,999 | 16780 | |
| not | 8547 | 7.0% |
| displayed | 7741 | 6.4% |
| 1-24,999 | 7241 | 6.0% |
| employed | 806 | 0.7% |
| 0 | 621 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 355717 | |
| 9 | 325630 | |
| , | 183367 | |
| $ | 104519 | 7.2% |
| - | 86710 | 6.0% |
| 5 | 79469 | 5.5% |
| 4 | 69930 | 4.8% |
| 7 | 47529 | 3.3% |
| 2 | 39181 | 2.7% |
| 1 | 24429 | 1.7% |
| Other values (14) | 127493 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 941885 | |
| Other Punctuation | 183367 | 12.7% |
| Currency Symbol | 104519 | 7.2% |
| Lowercase Letter | 93211 | 6.5% |
| Dash Punctuation | 86710 | 6.0% |
| Math Symbol | 17188 | 1.2% |
| Space Separator | 8547 | 0.6% |
| Uppercase Letter | 8547 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 16288 | |
| e | 9353 | |
| o | 9353 | |
| t | 8547 | |
| p | 8547 | |
| l | 8547 | |
| y | 8547 | |
| i | 7741 | |
| s | 7741 | |
| a | 7741 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 355717 | |
| 9 | 325630 | |
| 5 | 79469 | 8.4% |
| 4 | 69930 | 7.4% |
| 7 | 47529 | 5.0% |
| 2 | 39181 | 4.2% |
| 1 | 24429 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 183367 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 104519 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86710 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17188 |
Space Separator
| Value | Count | Frequency (%) |
| 8547 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 8547 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1342216 | |
| Latin | 101758 | 7.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 355717 | |
| 9 | 325630 | |
| , | 183367 | |
| $ | 104519 | 7.8% |
| - | 86710 | 6.5% |
| 5 | 79469 | 5.9% |
| 4 | 69930 | 5.2% |
| 7 | 47529 | 3.5% |
| 2 | 39181 | 2.9% |
| 1 | 24429 | 1.8% |
| Other values (2) | 25735 | 1.9% |
Latin
| Value | Count | Frequency (%) |
| d | 16288 | |
| e | 9353 | |
| o | 9353 | |
| t | 8547 | |
| p | 8547 | |
| l | 8547 | |
| y | 8547 | |
| N | 8547 | |
| i | 7741 | |
| s | 7741 | |
| Other values (2) | 8547 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1443974 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 355717 | |
| 9 | 325630 | |
| , | 183367 | |
| $ | 104519 | 7.2% |
| - | 86710 | 6.0% |
| 5 | 79469 | 5.5% |
| 4 | 69930 | 4.8% |
| 7 | 47529 | 3.3% |
| 2 | 39181 | 2.7% |
| 1 | 24429 | 1.7% |
| Other values (14) | 127493 | 8.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 110.5 KiB |
| True | |
|---|---|
| False | 8587 |
| Value | Count | Frequency (%) |
| True | 104479 | |
| False | 8587 | 7.6% |
| Distinct | 13502 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5605.11958 |
| Minimum | 0 |
|---|---|
| Maximum | 1750002.917 |
| Zeros | 1394 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1533 |
| Q1 | 3199.395833 |
| median | 4666.666667 |
| Q3 | 6824.6875 |
| 95-th percentile | 12250 |
| Maximum | 1750002.917 |
| Range | 1750002.917 |
| Interquartile range (IQR) | 3625.291667 |
Descriptive statistics
| Standard deviation | 7495.595563 |
|---|---|
| Coefficient of variation (CV) | 1.337276655 |
| Kurtosis | 26784.24094 |
| Mean | 5605.11958 |
| Median Absolute Deviation (MAD) | 1750 |
| Skewness | 125.0987676 |
| Sum | 633748450.4 |
| Variance | 56183952.84 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4166.666667 | 3486 | 3.1% |
| 5000 | 3367 | 3.0% |
| 3333.333333 | 2889 | 2.6% |
| 3750 | 2399 | 2.1% |
| 5416.666667 | 2351 | 2.1% |
| 5833.333333 | 2284 | 2.0% |
| 6250 | 2255 | 2.0% |
| 2500 | 2238 | 2.0% |
| 4583.333333 | 2186 | 1.9% |
| 6666.666667 | 2139 | 1.9% |
| Other values (13492) | 87472 |
| Value | Count | Frequency (%) |
| 0 | 1394 | |
| 0.083333 | 251 | 0.2% |
| 0.25 | 1 | < 0.1% |
| 0.833333 | 1 | < 0.1% |
| 1.416667 | 1 | < 0.1% |
| 1.666667 | 1 | < 0.1% |
| 1.833333 | 2 | < 0.1% |
| 1.916667 | 1 | < 0.1% |
| 2.083333 | 1 | < 0.1% |
| 2.166667 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1750002.917 | 1 | |
| 618547.8333 | 1 | |
| 483333.3333 | 1 | |
| 466666.6667 | 1 | |
| 416666.6667 | 1 | |
| 394400 | 1 | |
| 250000 | 1 | |
| 208333.3333 | 1 | |
| 185081.75 | 2 | |
| 158333.3333 | 1 |
| Distinct | 1207 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 8472 |
| Missing (%) | 7.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2760324777 |
| Minimum | 0 |
|---|---|
| Maximum | 10.01 |
| Zeros | 19 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.06 |
| Q1 | 0.14 |
| median | 0.22 |
| Q3 | 0.32 |
| 95-th percentile | 0.51 |
| Maximum | 10.01 |
| Range | 10.01 |
| Interquartile range (IQR) | 0.18 |
Descriptive statistics
| Standard deviation | 0.5537376038 |
|---|---|
| Coefficient of variation (CV) | 2.006059607 |
| Kurtosis | 259.5241037 |
| Mean | 0.2760324777 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 15.38596561 |
| Sum | 28871.34097 |
| Variance | 0.3066253338 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.18 | 4103 | 3.6% |
| 0.22 | 3662 | 3.2% |
| 0.17 | 3595 | 3.2% |
| 0.14 | 3531 | 3.1% |
| 0.2 | 3445 | 3.0% |
| 0.16 | 3409 | 3.0% |
| 0.19 | 3372 | 3.0% |
| 0.15 | 3321 | 2.9% |
| 0.21 | 3193 | 2.8% |
| 0.13 | 3152 | 2.8% |
| Other values (1197) | 69811 | |
| (Missing) | 8472 | 7.5% |
| Value | Count | Frequency (%) |
| 0 | 19 | < 0.1% |
| 0.00044 | 1 | < 0.1% |
| 0.0031 | 1 | < 0.1% |
| 0.00611 | 1 | < 0.1% |
| 0.00647 | 1 | < 0.1% |
| 0.00677 | 1 | < 0.1% |
| 0.00722 | 1 | < 0.1% |
| 0.01 | 250 | |
| 0.01042 | 1 | < 0.1% |
| 0.01051 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10.01 | 272 | |
| 9.77 | 1 | < 0.1% |
| 9.44 | 1 | < 0.1% |
| 9.2 | 1 | < 0.1% |
| 9.06 | 1 | < 0.1% |
| 8.63 | 1 | < 0.1% |
| 8.53 | 1 | < 0.1% |
| 8.52 | 1 | < 0.1% |
| 8.27 | 1 | < 0.1% |
| 8.13 | 1 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2255 |
| Missing (%) | 2.0% |
| Memory size | 7.0 MiB |
| Employed | |
|---|---|
| Full-time | |
| Self-employed | 6052 |
| Not available | 5347 |
| Other | 3742 |
| Other values (3) | 2718 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.68365054 |
| Min length | 5 |
Characters and Unicode
| Total characters | 962244 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Self-employed |
|---|---|
| 2nd row | Employed |
| 3rd row | Not available |
| 4th row | Employed |
| 5th row | Employed |
Common Values
| Value | Count | Frequency (%) |
| Employed | 66598 | |
| Full-time | 26354 | 23.3% |
| Self-employed | 6052 | 5.4% |
| Not available | 5347 | 4.7% |
| Other | 3742 | 3.3% |
| Part-time | 1088 | 1.0% |
| Not employed | 835 | 0.7% |
| Retired | 795 | 0.7% |
| (Missing) | 2255 | 2.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| employed | 67433 | |
| full-time | 26354 | 22.5% |
| not | 6182 | 5.3% |
| self-employed | 6052 | 5.2% |
| available | 5347 | 4.6% |
| other | 3742 | 3.2% |
| part-time | 1088 | 0.9% |
| retired | 795 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 142939 | |
| e | 124545 | |
| m | 100927 | |
| o | 79667 | |
| d | 74280 | |
| p | 73485 | |
| y | 73485 | |
| E | 66598 | |
| t | 39249 | 4.1% |
| i | 33584 | 3.5% |
| Other values (15) | 153485 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 811757 | |
| Uppercase Letter | 110811 | 11.5% |
| Dash Punctuation | 33494 | 3.5% |
| Space Separator | 6182 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 142939 | |
| e | 124545 | |
| m | 100927 | |
| o | 79667 | |
| d | 74280 | |
| p | 73485 | |
| y | 73485 | |
| t | 39249 | 4.8% |
| i | 33584 | 4.1% |
| u | 26354 | 3.2% |
| Other values (6) | 43242 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 66598 | |
| F | 26354 | 23.8% |
| N | 6182 | 5.6% |
| S | 6052 | 5.5% |
| O | 3742 | 3.4% |
| P | 1088 | 1.0% |
| R | 795 | 0.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 33494 |
Space Separator
| Value | Count | Frequency (%) |
| 6182 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 922568 | |
| Common | 39676 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 142939 | |
| e | 124545 | |
| m | 100927 | |
| o | 79667 | |
| d | 74280 | |
| p | 73485 | |
| y | 73485 | |
| E | 66598 | |
| t | 39249 | 4.3% |
| i | 33584 | 3.6% |
| Other values (13) | 113809 |
Common
| Value | Count | Frequency (%) |
| - | 33494 | |
| 6182 | 15.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 962244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 142939 | |
| e | 124545 | |
| m | 100927 | |
| o | 79667 | |
| d | 74280 | |
| p | 73485 | |
| y | 73485 | |
| E | 66598 | |
| t | 39249 | 4.1% |
| i | 33584 | 3.5% |
| Other values (15) | 153485 |
| Distinct | 605 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 7625 |
| Missing (%) | 6.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.06058364 |
| Minimum | 0 |
|---|---|
| Maximum | 755 |
| Zeros | 1503 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 883.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 26 |
| median | 67 |
| Q3 | 137 |
| 95-th percentile | 297 |
| Maximum | 755 |
| Range | 755 |
| Interquartile range (IQR) | 111 |
Descriptive statistics
| Standard deviation | 94.43224105 |
|---|---|
| Coefficient of variation (CV) | 0.9830487956 |
| Kurtosis | 2.72591678 |
| Mean | 96.06058364 |
| Median Absolute Deviation (MAD) | 48 |
| Skewness | 1.581477048 |
| Sum | 10128724 |
| Variance | 8917.448151 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1503 | 1.3% |
| 4 | 1177 | 1.0% |
| 1 | 1171 | 1.0% |
| 3 | 1166 | 1.0% |
| 5 | 1147 | 1.0% |
| 2 | 1144 | 1.0% |
| 7 | 1102 | 1.0% |
| 8 | 1097 | 1.0% |
| 6 | 1093 | 1.0% |
| 12 | 1071 | 0.9% |
| Other values (595) | 93770 | |
| (Missing) | 7625 | 6.7% |
| Value | Count | Frequency (%) |
| 0 | 1503 | |
| 1 | 1171 | |
| 2 | 1144 | |
| 3 | 1166 | |
| 4 | 1177 | |
| 5 | 1147 | |
| 6 | 1093 | |
| 7 | 1102 | |
| 8 | 1097 | |
| 9 | 1022 |
| Value | Count | Frequency (%) |
| 755 | 1 | |
| 745 | 1 | |
| 733 | 1 | |
| 732 | 1 | |
| 731 | 1 | |
| 690 | 1 | |
| 685 | 1 | |
| 678 | 1 | |
| 672 | 1 | |
| 662 | 1 |
IsBorrowerHomeowner
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 110.5 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 57052 | |
| False | 56014 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | ListingKey | ListingCreationDate | ClosedDate | LoanStatus | Term | LoanOriginalAmount | MonthlyLoanPayment | ListingCategory (numeric) | BorrowerAPR | BorrowerRate | CreditGrade | ProsperRating (Alpha) | CreditScoreRangeLower | CreditScoreRangeUpper | IncomeRange | IncomeVerifiable | StatedMonthlyIncome | DebtToIncomeRatio | EmploymentStatus | EmploymentStatusDuration | IsBorrowerHomeowner | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1021339766868145413AB3B | 2007-08-26 19:09:29.263000000 | 2009-08-14 00:00:00 | Completed | 36 | 9425 | 330.43 | 0 | 0.16516 | 0.1580 | C | NaN | 640.0 | 659.0 | $25,000-49,999 | True | 3083.333333 | 0.17 | Self-employed | 2.0 | True |
| 1 | 1 | 10273602499503308B223C1 | 2014-02-27 08:28:07.900000000 | NaN | Current | 36 | 10000 | 318.93 | 2 | 0.12016 | 0.0920 | NaN | A | 680.0 | 699.0 | $50,000-74,999 | True | 6125.000000 | 0.18 | Employed | 44.0 | False |
| 2 | 2 | 0EE9337825851032864889A | 2007-01-05 15:00:47.090000000 | 2009-12-17 00:00:00 | Completed | 36 | 3001 | 123.32 | 0 | 0.28269 | 0.2750 | HR | NaN | 480.0 | 499.0 | Not displayed | True | 2083.333333 | 0.06 | Not available | NaN | False |
| 3 | 3 | 0EF5356002482715299901A | 2012-10-22 11:02:35.010000000 | NaN | Current | 36 | 10000 | 321.45 | 16 | 0.12528 | 0.0974 | NaN | A | 800.0 | 819.0 | $25,000-49,999 | True | 2875.000000 | 0.15 | Employed | 113.0 | True |
| 4 | 4 | 0F023589499656230C5E3E2 | 2013-09-14 18:38:39.097000000 | NaN | Current | 36 | 15000 | 563.97 | 2 | 0.24614 | 0.2085 | NaN | D | 680.0 | 699.0 | $100,000+ | True | 9583.333333 | 0.26 | Employed | 44.0 | True |
| 5 | 5 | 0F05359734824199381F61D | 2013-12-14 08:26:37.093000000 | NaN | Current | 60 | 15000 | 342.37 | 1 | 0.15425 | 0.1314 | NaN | B | 740.0 | 759.0 | $100,000+ | True | 8333.333333 | 0.36 | Employed | 82.0 | True |
| 6 | 6 | 0F0A3576754255009D63151 | 2013-04-12 09:52:56.147000000 | NaN | Current | 36 | 3000 | 122.67 | 1 | 0.31032 | 0.2712 | NaN | E | 680.0 | 699.0 | $25,000-49,999 | True | 2083.333333 | 0.27 | Employed | 172.0 | False |
| 7 | 7 | 0F1035772717087366F9EA7 | 2013-05-05 06:49:27.493000000 | NaN | Current | 36 | 10000 | 372.60 | 2 | 0.23939 | 0.2019 | NaN | C | 700.0 | 719.0 | $25,000-49,999 | True | 3355.750000 | 0.24 | Employed | 103.0 | False |
| 8 | 8 | 0F043596202561788EA13D5 | 2013-12-02 10:43:39.117000000 | NaN | Current | 36 | 10000 | 305.54 | 7 | 0.07620 | 0.0629 | NaN | AA | 820.0 | 839.0 | $25,000-49,999 | True | 3333.333333 | 0.25 | Employed | 269.0 | True |
| 9 | 10 | 0F123545674891886D9F106 | 2012-05-10 07:04:01.577000000 | NaN | Current | 60 | 13500 | 395.37 | 1 | 0.27462 | 0.2489 | NaN | C | 640.0 | 659.0 | $75,000-99,999 | True | 7500.000000 | 0.12 | Employed | 300.0 | False |
Last rows
| df_index | ListingKey | ListingCreationDate | ClosedDate | LoanStatus | Term | LoanOriginalAmount | MonthlyLoanPayment | ListingCategory (numeric) | BorrowerAPR | BorrowerRate | CreditGrade | ProsperRating (Alpha) | CreditScoreRangeLower | CreditScoreRangeUpper | IncomeRange | IncomeVerifiable | StatedMonthlyIncome | DebtToIncomeRatio | EmploymentStatus | EmploymentStatusDuration | IsBorrowerHomeowner | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 113056 | 113927 | E3433419834735803891976 | 2008-04-30 21:25:19.670000000 | 2011-05-09 00:00:00 | Completed | 36 | 4292 | 132.11 | 4 | 0.07469 | 0.0679 | AA | NaN | 760.0 | 779.0 | $100,000+ | True | 10333.333333 | 0.06 | Full-time | 69.0 | True |
| 113057 | 113928 | E34935176664905343E01EA | 2011-06-06 19:02:44.443000000 | 2011-09-19 00:00:00 | Completed | 36 | 2000 | 73.30 | 3 | 0.22362 | 0.1899 | NaN | C | 740.0 | 759.0 | $25,000-49,999 | True | 2333.333333 | 0.27 | Full-time | 22.0 | False |
| 113058 | 113929 | E3553583161337791FCB87F | 2013-07-06 17:40:01.657000000 | 2014-02-07 00:00:00 | Completed | 36 | 2500 | 101.25 | 2 | 0.30285 | 0.2639 | NaN | E | 660.0 | 679.0 | $50,000-74,999 | True | 4333.333333 | 0.05 | Employed | 25.0 | False |
| 113059 | 113930 | E35D3584034795373BCD69A | 2013-07-08 10:24:49.700000000 | NaN | Current | 36 | 3000 | 106.05 | 1 | 0.20053 | 0.1639 | NaN | B | 680.0 | 699.0 | $75,000-99,999 | True | 6250.000000 | 0.20 | Employed | 85.0 | True |
| 113060 | 113931 | E36F36005339663245C20F8 | 2014-01-16 20:13:08.040000000 | NaN | Current | 60 | 25000 | 565.50 | 3 | 0.15016 | 0.1274 | NaN | B | 800.0 | 819.0 | $75,000-99,999 | True | 8146.666667 | 0.28 | Employed | 12.0 | False |
| 113061 | 113932 | E6D9357655724827169606C | 2013-04-14 05:55:02.663000000 | NaN | Current | 36 | 10000 | 364.74 | 1 | 0.22354 | 0.1864 | NaN | C | 700.0 | 719.0 | $50,000-74,999 | True | 4333.333333 | 0.13 | Employed | 246.0 | True |
| 113062 | 113933 | E6DB353036033497292EE43 | 2011-11-03 20:42:55.333000000 | NaN | FinalPaymentInProgress | 36 | 2000 | 65.57 | 7 | 0.13220 | 0.1110 | NaN | A | 700.0 | 719.0 | $75,000-99,999 | True | 8041.666667 | 0.11 | Employed | 21.0 | True |
| 113063 | 113934 | E6E13596170052029692BB1 | 2013-12-13 05:49:12.703000000 | NaN | Current | 60 | 10000 | 273.35 | 1 | 0.23984 | 0.2150 | NaN | D | 700.0 | 719.0 | $25,000-49,999 | True | 2875.000000 | 0.51 | Employed | 84.0 | True |
| 113064 | 113935 | E6EB3531504622671970D9E | 2011-11-14 13:18:26.597000000 | 2013-08-13 00:00:00 | Completed | 60 | 15000 | 449.55 | 2 | 0.28408 | 0.2605 | NaN | C | 680.0 | 699.0 | $25,000-49,999 | True | 3875.000000 | 0.48 | Full-time | 94.0 | True |
| 113065 | 113936 | E6ED3600409833199F711B7 | 2014-01-15 09:27:37.657000000 | NaN | Current | 36 | 2000 | 64.90 | 1 | 0.13189 | 0.1039 | NaN | A | 680.0 | 699.0 | $50,000-74,999 | True | 4583.333333 | 0.23 | Employed | 244.0 | False |